issue/1090: QY机器添加flash attention by xgqdut2016 · Pull Request #1099 · InfiniTensor/InfiniCore

xgqdut2016 · 2026-03-20T07:29:58Z

include/infinicore/adaptor/aten_adaptor.hpp

PanZezhong1725 · 2026-03-20T07:33:58Z

include/infinicore/adaptor/aten_adaptor.hpp

    } else if (device.getType() == Device::Type::CPU) {
        return at::Device(at::kCPU);
+    } else if (device.getType() == Device::Type::QY) {
+        return at::Device(at::kCUDA, device.getIndex());


这代码nv能编译吗

qinyiqun · 2026-03-24T05:39:31Z

xmake/qy.lua

+
+local INFINI_ROOT = os.getenv("INFINI_ROOT") or (os.getenv(is_host("windows") and "HOMEPATH" or "HOME") .. "/.infini")
+
+local FLASH_ATTN_QY_CUDA_SO_CONTAINER_DEFAULT =


不要用hard code 的路径

qinyiqun · 2026-03-24T07:18:01Z

src/infinicore/ops/mha_kvcache/mha_kvcache_flashattn.cc

-    auto v_cache = infinicore::adaptor::to_aten_tensor(p->v_cache);
-    auto seqlens_k = std::optional<const at::Tensor>(infinicore::adaptor::to_aten_tensor(p->seqlens_k));
-    auto block_table = std::optional<at::Tensor>(infinicore::adaptor::to_aten_tensor(p->block_table));
+    // FlashAttention kernels expect standard dense layout (contiguous last dimension).


修改代码涉及其他平台时，需要重启一个编译选项，单独修改，不能影响原有其他平台代码

xgqdut2016 requested a review from a team March 20, 2026 07:29

PanZezhong1725 requested changes Mar 20, 2026

View reviewed changes

xgqdut2016 added 2 commits March 24, 2026 09:41

issue/1090: qy flash-attention

4f20b18

issue/1090: success link flash-attention.so

eef0acb

xgqdut2016 force-pushed the issue/1090 branch from 7e5b801 to eef0acb Compare March 24, 2026 01:43

xgqdut2016 added 2 commits March 24, 2026 09:59

issue/1090: qy flash guard

b4d88ec

issue/1090: success qy flash

4c85cdc

qinyiqun reviewed Mar 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue/1090: QY机器添加flash attention#1099

issue/1090: QY机器添加flash attention#1099
xgqdut2016 wants to merge 4 commits intomainfrom
issue/1090

xgqdut2016 commented Mar 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

PanZezhong1725 Mar 20, 2026

Uh oh!

qinyiqun Mar 24, 2026

Uh oh!

qinyiqun Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		local INFINI_ROOT = os.getenv("INFINI_ROOT") or (os.getenv(is_host("windows") and "HOMEPATH" or "HOME") .. "/.infini")

		local FLASH_ATTN_QY_CUDA_SO_CONTAINER_DEFAULT =

Conversation

xgqdut2016 commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

PanZezhong1725 Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

qinyiqun Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

qinyiqun Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xgqdut2016 commented Mar 20, 2026 •

edited

Loading